How to Use Temporal-Driven Constrained Clustering to Detect Typical Evolutions
نویسندگان
چکیده
In this paper, we propose a new time-aware dissimilarity measure that takes into account the temporal dimension. Observations that are close in the description space, but distant in time are considered as dissimilar. We also propose a method to enforce the segmentation contiguity, by introducing, in the objective function, a penalty term inspired from the Normal Distribution Function. We combine the two propositions into a novel time-driven constrained clustering algorithm, called TDCK-Means, which creates a partition of coherent clusters, both in the multidimensional space and in the temporal space. This algorithm uses soft semi-supervised constraints, to encourage adjacent observations belonging to the same entity to be assigned to the same cluster. We apply our algorithm to a Political Studies dataset in order to detect typical evolution phases. We adapt the Shannon entropy in order to measure the entity contiguity, and we show that our proposition consistently improves temporal cohesion of clusters, without any significant loss in the multidimensional variance.
منابع مشابه
Spatio-temporal patterns of crab fisheries in the main bays of Guangdong Province, China
Using a semi-balloon otter trawl, crab fisheries in the main bays of Guangdong Province, China, were carried out seasonally . A total of 70 species were found, all belonging to the South China Sea Faunal sub region in the tropical India-West-Pacific Faunal Region. The clustering and nMDS ordination analysis revealed the existence of three groups. Group 1 included Hailing Bay and four bays to ...
متن کاملSpatio-temporal patterns of crab fisheries in the main bays of Guangdong Province, China
Using a semi-balloon otter trawl, crab fisheries in the main bays of Guangdong Province, China, were carried out seasonally . A total of 70 species were found, all belonging to the South China Sea Faunal sub region in the tropical India-West-Pacific Faunal Region. The clustering and nMDS ordination analysis revealed the existence of three groups. Group 1 included Hailing Bay and four bays to ...
متن کاملRepeated Record Ordering for Constrained Size Clustering
One of the main techniques used in data mining is data clustering, which has many applications in computer science, biology, and social sciences. Constrained clustering is a type of clustering in which side information provided by the user is incorporated into current clustering algorithms. One of the well researched constrained clustering algorithms is called microaggregation. In a microaggreg...
متن کاملMulti-scale Community Detection in Temporal Networks Using Spectral Graph Wavelets
Abstract Spectral graph wavelets introduce a notion of scale in networks, and are thus used to obtain a local view of the network from each node. By carefully constructing a wavelet filter function for these wavelets, a multi-scale community detection method for monoplex networks has already been developed. This construction takes advantage of the partitioning properties of the network Laplacia...
متن کاملA Data-driven Method for Crowd Simulation using a Holonification Model
In this paper, we present a data-driven method for crowd simulation with holonification model. With this extra module, the accuracy of simulation will increase and it generates more realistic behaviors of agents. First, we show how to use the concept of holon in crowd simulation and how effective it is. For this reason, we use simple rules for holonification. Using real-world data, we model the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- International Journal on Artificial Intelligence Tools
دوره 23 شماره
صفحات -
تاریخ انتشار 2014